The Modulation Transfer Function for Speech Intelligibility

نویسندگان

  • Taffeta M. Elliott
  • Frédéric E. Theunissen
چکیده

We systematically determined which spectrotemporal modulations in speech are necessary for comprehension by human listeners. Speech comprehension has been shown to be robust to spectral and temporal degradations, but the specific relevance of particular degradations is arguable due to the complexity of the joint spectral and temporal information in the speech signal. We applied a novel modulation filtering technique to recorded sentences to restrict acoustic information quantitatively and to obtain a joint spectrotemporal modulation transfer function for speech comprehension, the speech MTF. For American English, the speech MTF showed the criticality of low modulation frequencies in both time and frequency. Comprehension was significantly impaired when temporal modulations <12 Hz or spectral modulations <4 cycles/kHz were removed. More specifically, the MTF was bandpass in temporal modulations and low-pass in spectral modulations: temporal modulations from 1 to 7 Hz and spectral modulations <1 cycles/kHz were the most important. We evaluated the importance of spectrotemporal modulations for vocal gender identification and found a different region of interest: removing spectral modulations between 3 and 7 cycles/kHz significantly increases gender misidentifications of female speakers. The determination of the speech MTF furnishes an additional method for producing speech signals with reduced bandwidth but high intelligibility. Such compression could be used for audio applications such as file compression or noise removal and for clinical applications such as signal processing for cochlear implants.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Designing modulation filters for improving speech intelligibility in reverberant environments

In this paper, we propose a new technique to design modulation filters to reduce degradation of speech intelligibility in reverberant environments. Using the inverse modulation transfer function, we design data-derived modulation filters for each speech frequency band. These filters preprocess speech signals between a microphone and a loudspeaker that radiates speech into a performance hall. Us...

متن کامل

Effects of suppressing steady-state portions of speech on intelligibility in reverberant environments

1. Introduction When listening to a lecture in a large auditorium it is often difficult to understand the speech. Among other factors, comprehension may be impaired by reverberation, which is sound reflecting from the wall, interfering with direct sound. Based on the modulation transfer function (MTF), the speech transmission index (STI) has been proposed as an objective measure for speech inte...

متن کامل

Speech Dynamics

This keynote presentation is about the various dynamic aspects of speech (energy envelope, spectral variation, voicing and pitch variation, speaking style and pronunciation variation, and influence of communication channel). The related speech signal characteristics are measured and modeled and are tested in listening experiments. Consequences for speech recognition and speech synthesis are dis...

متن کامل

مدل میکروسکوپی دوگوشی مبتنی بر فیلتر بانک مدولاسیون برای پیش گویی قابلیت فهم گفتار در افراد دارای شنوایی عادی

In this study, a binaural microscopic model for the prediction of speech intelligibility based on the modulation filter bank is introduced. So far, the spectral criteria such as the STI and SII or other analytical methods have been used in the binaural models to determine the binaural intelligibility. In the proposed model, unlike all models of binaural intelligibility prediction, an automatic ...

متن کامل

Refinement of an MTF-based speech dereverberation method using an optimal inverse-MTF filter

We previously proposed a speech dereverberation method based on the modulation transfer function (MTF). This method consists of power envelope restoration and carrier regeneration processes, and reduces both the loss due to degraded power envelopes and the loss of speech intelligibility. In the power envelope restoration, however, whether adaptive time-frequency division provides the best repre...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • PLoS Computational Biology

دوره 5  شماره 

صفحات  -

تاریخ انتشار 2009